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INSTRUCTION ENCODING WITHIN A DATA PROCESSING APPARATUS 
HAVING MULTIPLE INSTRUCTION SETS 

This invention relates to the field of data processing systems. More 
particularly, this invention relates to data processing systems having multiple 
instruction sets and the way in which such multiple instruction sets may be encoded. 

It is known to provide data processing systems with multiple instruction sets. 
An example of such data processing systems are processor cores produced by ARM 
Limited of Cambridge, England which support both the ARM and Thumb instruction 
sets. The ARM instruction set is a 32-bit instruction set and the Thumb instruction set 
is a 16-bit instruction set. Whilst data processing systems supporting multiple 
instruction sets allow an advantageous degree of flexibility in the way which program 
operations may be represented and can yield advantages such as improved code 
density, there is typically an increase in the amount of hardware needed to support the 
multiple instruction sets. 

Viewed from one aspect the present invention provides apparatus for 
processing data, said apparatus comprising 

data processing logic operable to perform data processing operations; arid 

an instruction decoder operable to decode program instructions specifying data 
processing operations to be performed by said data processing logic and to control skid 
data processing logic to perform said data, processing operations; wherein 

said instruction decoder is operable in a first mode in which program 
instructions of a first instruction set are decoded and in a second mode in which 
program instructions of a second instruction set are decoded, a subset of program 
instructigns of said first instruction set 

encoding with a subset of program instructions of said second instruction set and 
forming a common subset of instructions representing at least one class of instructions, 
said common subset of instructions controlling said data processing logic to perform 
the same data processing operations independent of whether said instruction decoder is 
operating in said first mode or said second mode. 
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The invention recognises that by arranging the encoding of the instruction sets 
such that a common subset share the same encoding (at least after any variations due to 
storage order, e.g endianness, have been compensated), then such systems can be 
advantageously simplified in their implementation and other aspects of their operation 
improved. As an example, common decoding logic and/common processing logic for 
implementing the processing operations specified may be more readily utilised with a 
reduction in hardware overhead needed to support multiple instruction sets. 

In preferred embodiments of the present invention the class of instructions 
included within the common subset of instructions (class for example being considered 
to be a group of instructions with similar functionality such as the load/store 
instructions, the multiply instructions, etc) include co-processor instructions such that 
the same co-processor logic can be used by both the first instruction set and the second 
instruction set. This is strongly advantageous since many co-processor designs exist 
and may have been developed for only one instruction set and the ability to reuse this 
same co-processor design with a further instruction set represents a significant 
advantage. It is particularly preferred when all co-processor instructions are within the 
common subset such that complete interoperability of a common group of co- 
processors may be provided with either instruction set. 

Whilst it will be appreciated that the first instruction set and the second 
instruction set can have many different characteristics, preferably the first instruction 
set is a fixed length instruction set, conveniently of 32-bit or 16-bit instructions, and 
the second instruction set is a variable length instruction set. 

It will be appreciated" that whilst the common sxibset "of iHStmctioiis" perform 
common data processing operations, it is not necessarily the case that these common 
data processing operations will always produce the same result data values. This does 
not detract from the hardware saving and other advantages mentioned above, but may 
be the result of specific peculiarities associated with certain data values to be used as 
input operands depending upon which instruction set is being utilised. Examples of 
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data values serving as input operands whieh may have instruetion se, specfic 

peculiarities are the program counter value, which may be maintained relative to the 
. instruction being executed to have a different value depending upon the instruct™ se. 

used and/or the program status register vahre which may hold one or more b,.s 
5 indicating which instruction se, is being used and accordingly have different vahres 

depending upon which instruction set is being used. 

Viewed from another aspect the present invention provides a method of 

processing data, said method comprising the steps of: 
10 * performing data processing operations with data processing logic; and 

decoding with an instruction decoder program instructions specifymg dam 
processing operations to be performed by said data processing logic and con.ro.hng 
said data processing logic .o perform said da.a processing operations; wherem 

in a firs, mode program instructions of a first instruction set ate decoded and m 

15 a second mode program instructions of a second instruction se. are decoded, a subset 
of program instructions of said first instruction se. having a common storage order 
compensated encoding with a subset of program instructions of said second instruction 
se, and forming a common subse, of instiuetions representing at leas, one class of 
functions, said common subse, of mictions controlling said data processing logrc 

20 to perform the same data processing operations independent of whether sard 
instruction decode, is operating in said first mode or said second mode. 

Viewed from a further aspect the present invention provides a computer 
program product having a computer program operable ,0 control a data processus 
25 apparatus containing dam processing logic operable to perform data processing 
operations, said computer program comprising: 

- program instructions of a firs, instroc-ion .set and program instrucuons of a 
second instruction se, tha. control said data processing logic ,o perform said data 
processing operations; wherein 
30 a subset of progrom instructions of said firs, instruction se, have a common storage 
order compensated encoding with a subse, of program instructions of said second 
instruction se, and form a common subset of instructions representing at leas, one class 
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of instructions, said common subset of instructions controlling data processing logic to 
perform the same data processing operations independent of whether instructions of 
said first instruction set or of said second instruction set are being decoded. 

An embodiment of the invention will now be described, by way of example 
only, with reference to the accompanying drawings in which: 

Figure 1 schematically illustrates a data processing apparatus using multiple 
instruction sets and having an associated coprocessor; 

Figure 2 illustrates the encoding of BL and BLX instructions in the Thumb 
instruction set; 

Figure 3 illustrates how the encodings of 16-bit and 32-bit instructions are 
distinguished from each other in an enhanced version of the Thumb instruction set; 

Figure 4 illustrates the encodings of coprocessor instructions in the ARM 
instruction set; 

Figure 5 illustrates the encodings of unconditional coprocessor instructions in 
the enhanced version of the Thumb instruction set; 

Figure 6 illustrates the difference in storage order of an unconditional CDP 
coprocessor instruction in the ARM instruction set and the enhanced Thumb 
instruction set; 

Figure 7 illustrates a way in which iristmctioh decoders for the ARM and 
Thumb instruction sets can be designed; 

Figure 8 illustrates a second way in which instruction decoders for the ARM 
and Thumb instruction sets can be designed; 
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Figure 9 illustrates how the instruction decoders of Figure 8 are modified to 
decode the ARM and enhanced Thumb instruction sets; and 

Figure 10 schematically illustrates the architecture of a general purpose 
computer which may implement program instructions in accordance with the above 
described techniques. 

Figure 1 shows a data processing apparatus 2. The data processing apparatus 2 
includes a processor core 3 containing a register bank 4, a multiplier 6, a shifter 8, an 
adder 10, instruction decoders 12 (including a common subset instruction decoder 13), 
an instruction prefetch buffer 14, an instruction prefetch unit 16 and program status 
registers 18. One of the registers within the register bank 4 is a program counter 
register 20 storing an address value having a predetermined relationship to the address 
value of the currently executing program instruction. This relationship can vary 
depending upon which instruction set is being used at the current time, e.g. current 
instruction address plus 4, current instruction address plus 8 etc. It will be appreciated 
that although only one coprocessor is shown in Figure 1, multiple coprocessors could 
be attached, with instructions intended for execution by different coprocessors being 
distinguished by a coprocessor number field in each coprocessor instruction. 

A coprocessor 22 is coupled to the processor core 3 and shares instruction 
words and data words with the processor core 3. The coprocessor 22 is responsive to 
coprocessor instructions within the instruction stream fetched by the instruction 
prefetch unit 16. 

Coprocessor instructions are executed by the processor core 3 in conjunction 
with the co^ocessor 22. In the i^ 

register bank and internal data processing logic. Coprocessor instructions are identified 
by the common subset instruction decoder 12 and passed to the coprocessor together 
with control signals that determine when the coprocessor instruction is to be executed. 
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Some coprocessor instructions read all their operands from and write all their 
results to the coprocessor's internal register bank, and so require no further interaction 
with the processor core 3. Other coprocessor instructions require data values to be 
passed into the coprocessor 22. Possible sources for these data values include the 
register bank 4, the program status registers 18 and memory (not shown). In the 
illustrated apparatus, the processor core 3 obtains such data values from the desired 
location(s) and transfers them to the coprocessor 22 via the data interface D. Yet other 
coprocessor instructions transfer data values produced by the coprocessor 22 to 
destinations external to the coprocessor 22, such as the register bank 4, the program 
status registers 18 and memory. In the illustrated apparatus, such data values are 
transferred via the data interface D to the processor core 3, which writes them to the 
desired destination. 

It will be appreciated that many variations on this type of coprocessor 
operation are possible and encompassed within the present technique. For example, 
the coprocessor 22 might lack an internal register bank, so that its internal data 
processing logic always acts on data values transferred from the processor core 3 via 
the data interface D and sends its result values back to the processor core 3 via the data 
interface D. A second example is that the coprocessor 22 might have a separate 
interface to memory, so that it can load data values from memory and store them to 
memory without involving the processor core 3. In this case, the data interface P is 
only used for data values coming from or going to locations within the processor core 
3, such as the register bank 4 and the program status registers 18. Alternatively, the 
data interface D could be omitted entirely, so that data values can only be transferred 
between the processor core 3 and the coprocessor 22 by first storing them to memory 
from one and then loading them from memory into the other. 

It will be appreciated that in operation program instructions are fetched from 
memory addresses within a memory (not illustrated) and passed to the instruction 
prefetch buffer 14. When the program instructions reach the decode stage within the 
instruction prefetch buffer 14, the instruction decoders 12 decode these instructions 
and generates control signals which are applied to the processing logic within the 
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processor core 3, and the coprocessor 22 as necessary, to control these other elements 
to execute the data processing operation(s) specified. The processor core 3 is operable 
in a first mode in which a first instruction set is being decoded and in a second mode in 
which a second instruction set is being decoded. One way of indicating which mode 
5 the processor core 3 is in is to use a flag value within one of the program status 
registers 18. Depending upon which instruction set is currently active, the instruction 
decoders 12 will interpret the instructions received in accordance with the currently 
active instruction set and its encoding. In accordance with the present technique, the 
two instruction sets supported share a common encoding for a common subset of 

10 instructions, including at least one class of instructions, such as all unconditional 
coprocessor instructions, thereby enabling ready reuse of the same physical hardware 
to implement those common processing operations. It will be appreciated that the 
storage order of the program instructions from the different instruction sets may vary, 
such as due to endianness differences, instruction word size differences and the like, 

15 but the common subset of instructions share an encoding once such storage order 
differences have been compensated for as will be discussed hereafter. 

In this example embodiment, the processor core 3 supports two instruction sets, 
the ARM instruction set and an enhanced version of the Thumb instruction set (see the 
20 ARM Architecture Reference Manual for details of the existing ARM and Thumb 
instruction sets). The enhancements to the Thumb instruction set include a change to 
its BL and BLX instructions that makes it possible to add a substantial number of 32- 
bit instructions. 

25 Prior to the enhancement to the Thumb instruction set, its BL and BLX 

instructions consist of two successive halfwords in the instruction stream of the forms 
shown in Figure 2. In valid programs, inst^ction halfwords whose most significant 
five bits are 11101, 11110 or 11111 only appear in pairs of these two forms. All other 
instructions other than BL and BLX consist of a single halfword whose most 

30 significant five bits are in the range 00000 to 1 1 100. 
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An implementation can execute the two halfwords of a BL or BLX instruction 
in succession, and the combined effects of executing them in this manner produce the 
desired effect of the BL or BLX instruction, which is to perform a subroutine call to a 
Thumb subroutine (for BL) or an ARM subroutine (BLX). More precisely, it is to 
branch to a target address, switching over to the ARM instruction set if the instructs 
is BLX and to place a pointer to the instruction following the second halfword in 
register 14, which is the subroutine link register in the ARM/Thumb instruction set 
architecture. The subroutine called can return by branching to this pointer. 

Alternatively, an implementation can decode the two halfwords together as a 
32-bit instruction and perform the desired effect of the BL or BLX instruction directly. 
In the unenhanced version of the Thumb instruction set, this is an optimisation that 
allows the BL or BLX instruction to be executed more efficiently, but is not essential. 

The enhanced version of the Thumb instruction set instead requires that (under 
conditions to be described) two successive halfwords from the instruction stream are 
treated as a 32-bit instruction and not as two 16-bit halfwords to be executed in 
sequence With each other. Specifically, as shown in Figure 3, if the top 5 bits of the 
next halfword to be executed take any of the values 00000 through to 11100, it is 
treated as a 16-bit instruction. If they take one of the values 11101, 11110 and 11111, 
it and the following halfword are treated together as a 32-bit instruction. All existing 
BL and BLX instructions are therefore required to be treated as 32-bit instructions. 

All of the instructions of the unenhanced Thumb instruction set are still 
available in the enhanced version, BL and BLX as 32-bit instructions and all other 
instructions as _l&Wt msfructions. Comparing Figure 2 with Figure 3, it is apparent 
~^^^^P^P^'^'^^^' : ^ instructions are BL and BLX 
instructions. The other 32-bit instructions can be used in the enhanced version to 
provide additional functionality not present in the unenhanced version. In particular, 
the ARM instruction set contains many groups of instructions that are not available in 
the unenhanced Thumb instruction set, and the 32-bit instructions other than BL and 
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BLX can be used to provide Thumb equivalents of those instructions in the enhanced 
Thumb instruction set. 

Coprocessor instructions are an example of such a group of instructions: they 
are present in the ARM instruction set but have no equivalents in the unenhanced 
version of the Thumb instruction set. ARM instructions are 32-bit words; when 
bits[27:24] of such an instruction take one of the values 1100, 1101 and 1110, the 
instruction is a coprocessor instruction. Figure 4 shows the major groups of these 
instructions, which are: 

LDC instructions, which transfer data values from memory into the 
coprocessor; 

STC instructions, which transfer data values from the coprocessor to memory; 

MCR instructions, which transfer a data value from a processor core register 
into the coprocessor; 

MCRR instructions, which transfer two data values from processor core 
registers into the coprocessor; 

MRC instructions, which transfer a data value from the coprocessor to a 
processor core register; 

MRRC instructions, which transfer two data values from the coprocessor to 
processor core registers; 

CDP instructions, which perform operations within the coprocessor that do not 
require data values to be transferred in or out of the coprocessor; 

Undefined instructions, which cause an undefined instruction exception in the 
processor core 3. 
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All of these coprocessor instructions contain a cpnum field, which identifies 
the coprocessor for which the instruction is intended, and a cond field. The value of 
cond is 1110 or 1111 for unconditional coprocessor instructions, while values in the 
5 range 0000 to 1101 produce conditional versions of the equivalent instruction with 
cond = 1110. (The unconditional instructions with cond = 1111 do not have 
conditional versions.) 

All of these coprocessor instructions can be executed on data processing 
10 apparatus 2, using the techniques described above. 

Equivalent instructions to the unconditional ARM coprocessor instructions are 
encoded in the enhanced Thumb instruction set as shown in Figure 5. Comparing this 
with Figure 4, and noting that cond = 1110 or 1111 for unconditional ARM 
15 instructions, it will be appreciated that the encoding of each unconditional Thumb 
coprocessor instruction is identical to that of the equivalent ARM coprocessor 
instruction apart from storage order considerations. Furthermore, all of these 
unconditional Thumb coprocessor instructions have encodings in which HW1[15:1 1] = 
11101 or 11111, and so are 32-bit instructions according to the rule illustrated in 
20 Figures. 

The storage order considerations arise from the fact that ARM instruction 
words and Thumb instruction halfwords are normally stored in little-endian order, i.e. 
with their least significant byte at the lowest memory address. As an example, an 

25 ARM unconditional CDP instruction whose memory address is A will be held in the 
four bytes at addresses A, A+l, A+2 and A+3 as shown on the left hand side of Figure 
6. The equivalent instruction in the enhanced Thumb ihstruction set will be held with 
the first^lialfword HW1 at address A and the second halfword HW2 at address^A+2, 
and so will result in the instruction being held in the four bytes as shown on the right 

30 hand side of Figure 6. Comparing the two sides, the Thumb instruction consists of the 
same bytes as the ARM instruction, but in a different storage order: the bytes at 
addresses A and A+2 are swapped over relative to each other, as are the bytes at 
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addresses A+l and A+3. It will be appreciated that the same difference in storage order 
will exist for all of the unconditional coprocessor instructions, regardless of the precise 
instruction type. 

The enhanced Thumb instruction set is a variable-length instruction set, 
containing both 16-bit and 32-bit instructions. The first step in decoding such an 
instruction set is to identify the individual instructions in the instruction prefetch buffer 
14. In this embodiment, each entry in the instruction prefetch buffer 14 is a word- 
aligned word previously fetched from memory by the instruction prefetch unit 16. 
Denoting the entry at the head of the buffer as WORD1 and the next entry as WORD2, 
the instructions passed to the instruction decoders 12 are determined by the following 
rules: 



Current 

instruction 

set 


Instruction 
address 
mod 4 


WORD1 [15:11] 


W0RD1[31:26] 


Instruction 
to decoder 
bits[31:16] 


sent 

bits [15:0] 


ARM 


Always 0 


any 


any 


WORD1 [31:16] 


WORD1 [15:0] 


Thumb 


0 


00000-11100 


any 


0 


WORD1 [15 : 0] 


Thumb 


0 


11101-11111 


any 


WORD1[15:0] 


WORD1 [31:16] 


Thumb 


2 


any 


00000-11100 


0 


WORD1 [31:16] 


Thumb 


2 


any 


illOl-11111 


WORD1 [31:16] 


WORD 2 [15 : 0] 



This sends the correct instruction to the instruction decoder 12, with the storage 
order compensated for. It is always sent as 32 bits, with the most significant 16 bits 
being zero for a 16-bit Thumb instruction. It will be appreciated that for the Thumb 
instruction set, no separate bit is required to tell the instruction decoder 12 whether the 
insimction is aa6-^ 

32 bits sent to the instruction decoder 12 is always 0 for a 16-bit instruction and 1 for a 
32-bit instruction. 

It will be appreciated that processor core implementations vary considerably as 
regards how they prefetch instructions and how many instructions they decode at a 
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time, and therefore that this method of identifying the individual instructions will need 
to be modified according to these and other aspects of the processor core 
implementation. 

5 There are two common existing ways to implement the instruction decoders 12 

so that they can handle both the ARM and the Thumb instruction sets. In one, 
illustrated in Figure 7, a Thumb-to- ARM converter contains logic that converts each 
Thumb instruction into the corresponding ARM instruction. Multiplexors select the 
original instruction if ARM code is being executed, or the converted instruction if 

) Thumb code is being executed. The output of the multiplexors is then decoded by an 
ARM decoder. 

This form of the instruction decoders 12 can be modified to decode the 
unconditional coprocessor instructions that have been included in the enhanced Thumb 
instruction set, by making the Thumb-to-ARM converter pass the original instruction 
through unchanged to the multiplexor if bits[27:24] of the original instruction were 
1 100, 1 101 or 1 1 10 (these bits will always be 0000 for a 16-bit Thumb instruction, and 
some value other than 1100, 1101 or 1110 for a non-coprocessor 32-bit Thumb 
instruction). It will be appreciated that the enhanced Thumb instruction set may 
contain further 32-bit instructions besides BL, BLX and the unconditional coprocessor 
instructions; if it does, then further modifications will also be made to the Thumb-to- 
ARM converter to handle those instructions. 

The second common existing way to implement the instruction decoders 12 is 
to use separate decoders for the ARM and Thumb instruction sets, as illustrated in 
Figure 8. Multiplexors are then used to select the outputs of the ARM decoder if ARM 
code is being executed, or the outputs of the Thumb decoder if Thumb code is being 
* executed. 

This form of the instruction decoders 12 can be modified to decode the 
unconditional coprocessor instructions by splitting the ARM decoder into a part that 
decodes coprocessor instructions and a part that decodes non-coprocessor ARM 
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instructions, as illustrated in Figure 9. If the enhanced Thumb instruction set contains 
further 32-bit instructions besides BL, BLX and the unconditional coprocessor 
instructions, the non-coprocessor Thumb decoder is modified to handle them. The final 
multiplexors then select the outputs of the coprocessor decoder if bits[27:24] of the 
instruction are 1100, 1101 or 1110; otherwise, they select the outputs of the non- 
coprocessor ARM decoder if ARM code is being executed, or the outputs of the non- 
coprocessor Thumb decoder if Thumb code is being executed. 

In the first form of the instruction decoders 12, the common subset instruction 
decoder 13 is the part of the ARM decoder that decodes coprocessor instructions. In 
the second form, it is the coprocessor decoder. 

The use of a common storage order compensated encoding for the 
unconditional coprocessor instructions in the ARM and enhanced Thumb instruction 
sets has considerable advantages for both forms of the instruction decoders 12 
compared with the use of different encodings in each instruction set, in terms of 
reducing the amount of logic required and the amount of power consumed. A further 
advantage is that the coprocessor 22 only needs to be able to execute the same 
instructions as it could before the enhancement to the Thumb instruction set. As well 
as avoiding increases in the logic required by coprocessors and the power they 
consume, this also means that existing coprocessors do not need to be modified to be 
usable from the enhanced Thumb instruction set. 

It will be appreciated that the technique described above can be modified to 
perform the storage order compensation at other places within the design, such as 
within the Thumb-to-ARM converter in Figure 7, and that like any other logical 
" design, it can- be transformed into many logically equivalent, designs.^ 
technique encompasses all such modifications and transformations. 

Figure 10 schematically illustrates a general purpose computer 200 which may 
implement program instructions in accordance with the above described techniques. 
The general purpose computer 200 includes a central processing unit 202, a random 
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access memory 204, a read only memory 206, a network interface card 208, a hard 
disk drive 210, a display driver 212 and monitor 214 and a user input/output circuit 
216 with a keyboard 218 and mouse 220 all connected via a common bus 222. In 
operation the central processing unit 202 will execute computer program instructions 
that may be stored in one or more of the random access memory 204, the read only 
memory 206 and the hard disk drive 210 or dynamically downloaded via the network 
interface card 208. The results of the processing performed may be displayed to a user 
via the display driver 212 and the monitor 214. User inputs for controlling the 
operation of the general purpose computer 200 may be received via the user input 
output circuit 216 from the keyboard 218 or the mouse 220. It will be appreciated that 
the computer program could be written in a variety of different computer languages. 
The computer program may be stored and distributed on a recording medium or 
dynamically downloaded to the general purpose computer 200. When operating under 
control of an appropriate computer program, the general purpose computer 200 can 
perform the above described techniques and can be considered to form an apparatus 
for performing the above described technique. The architecture of the general purpose 
computer 200 could vary considerably and Figure 10 is only one example. 
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CLAIMS 

1. Apparatus for processing data, said apparatus comprising: 

data processing logic operable to perform data processing operations; and 
an instruction decoder operable to decode program instructions specifying data 
processing operations to be performed by said data processing logic and to control said 
data processing logic to perform said data processing operations; wherein 

said instruction decoder is operable in a first mode in which program 
instructions of a first instruction set are decoded and in a second mode in which 
program instructions of a second instruction set are decoded, a subset of program 
instructions of said first instruction set having a common storage order compensated 
encoding with a subset of program instructions of said second instruction set and 
forming a common subset of instructions representing at least one class of instructions, 
said common subset of instructions controlling said data processing logic to perform 
the same data processing operations independent of whether said instruction decoder is 
operating in said first mode or said second mode. 

2. Apparatus as claimed in claim 1, wherein said instruction decoder is operable 
to use common portions of said data processing logic to execute instructions of said 
common subset of instructions. 

3. Apparatus as claimed in any one of claims 1 and 2, wherein said common 
subset of instructions includes a class of instructions being coprocessor instructions 
operable to control coprocessor data processing operations using coprocessor logic 
common to said first instruction set and said second instruction set. 

4. Apparatus as claimed in claim 3, wherein all unconditional coprocessor 
instructions are within said common subset. 

5. Apparatus as claimed in any one of the preceding claims, wherein said first 
instruction set is a fixed length instruction set of N-bit instructions. 
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6. Apparatus as claimed in claim 5, wherein N is one of 32 or 1 6. 

7. Apparatus as claimed in any one of the preceding claims, wherein said second 
5 instruction set is a variable length instruction set. 

8. Apparatus as claimed in any one of the preceding claims, wherein at least one 
program instruction within said common subset of instructions performs common data 
processing operations in either said first mode or said second mode but generates 
different result data values depending upon whether said instruction decoder is 
operating in said first mode or said second mode. 



10 



9. Apparatus as claimed in claim 8, wherein said at least one program instruction 
generating different result data values includes a program counter value as an input 

15 operand. 

1 0. Apparatus as claimed in claim 9, wherein a different relationship is maintained 
between said program counter value and an address of an instruction being executed 
depending upon whether said instruction decoder is operating in said first mode or said 

20 second mode. 

1 1. Apparatus as claimed in claim 8, wherein said at least one program instruction 
generating different result data values includes a program status register value as an 
input operand. 

>5 

12. A method of processing data, said method comprising the steps of: 

performing data processing operations with data processing logic; and- 

decoding with an instruction decoder program instructions specifying data 

processing operations to be performed by said data processing logic and controlling 
0 said data processing logic to perform said data processing operations; wherein 

in a first mode program instructions of a first instruction set are decoded and in 
a second mode program instructions of a second instruction set are decoded, a subset 
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of program, instructions of said first instruction set having a common storage order 
compensated encoding with a subset of program instructions of said second instruction 
set and forming a common subset of instructions representing at least one class of 
instructions, said common subset of instructions controlling said data processing logic 
to perform the same data processing operations independent of whether said 
instruction decoder is operating in said first mode or said second mode. 

13. A method as claimed in claim 12, wherein common portions of said data 
processing logic are used to execute instructions of said common subset of 
instructions. 

14. A method as claimed in any one of claims 12 and 13, wherein said common 
subset of instructions includes a class of instructions being coprocessor instructions 
operable to control coprocessor data processing operations using coprocessor logic 
common to said first instruction set and said second instruction set. 

15. A method as claimed in, claim 14, wherein all unconditional coprocessor 
instructions are within said common subset. 

16. A method as claimed in any one of claims 12 to 15, wherein said first 
instruction set is a fixed length instruction set ofN-bit instructions. 

17. A method as claimed in claim 16, wherein N is one of 32 or 16. 

18. A method as claimed in any one of claims 12 to 17, wherein said second 
instruction set is a variable length instruction set. 

19. A method as claimed in any one of claims 12 to 18, wherein at least one 
program instruction within said common subset of instructions performs common data 
processing operations in either said first mode or said second mode but generates 
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different result data values depending upon whether said instruction decoder is 
operating in said first mode or said second mode. 

20. A method as claimed in claim 19, wherein said at least one program instruction 
generating different result data values includes a program counter value as an input 
operand. 

21 . A method as claimed in claim 20, wherein a different relationship is maintained 
between said program counter value and an address of an instruction being executed 
depending upon whether said instruction decoder is operating in said first mode or said 
second mode. 

22. A method as claimed in claim 19, wherein said at least one program instruction 
generating different result data values includes a program status register value as an 
input operand. 

23. A computer program product having a computer program operable to control a 
data processing apparatus containing data processing logic operable to perform data 
processing operations, said computer program comprising: 

program instructions of a first instruction set and program instructions of a 
second instruction set, that control said data processing logic to perform said data 
processing operations; wherein 

a subset of program instructions of said first instruction set have a common storage 
order compensated encoding with a subset of program instructions of said second 
instruction set and form a common subset of instructions representing at least one class 
of instructions, said common subset of instructions controlling data processing logic to 
perform the same data processing operations independent of whether instructions of 
said first instruction set or of said second instruction set are being decoded. 
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24. A computer program product as claimed in claim 23, wherein common 
portions of said data processing logic are used to execute instructions of said common 
subset of instructions. 

25. A computer program product as claimed in any one of claims 23 and 24, 
wherein said common subset of instructions includes a class of instructions being 
coprocessor instructions operable to control coprocessor data processing operations 
using coprocessor logic common to said first instruction set and said second 
instruction set. 

26. A computer program product as claimed in claim 25, wherein all unconditional 
coprocessor instructions are within said common subset. 

27. A computer program product as claimed in any one of claims 23 to 26, wherein 
said first instruction set is a fixed length instruction set of N-bit instructions. 

28. A computer program product as claimed in claim 27, wherein N is one of 32 or 
16. 

29. A computer program product as claimed in any one of claims 23 to 28, wherein 
said second instruction set is a variable length instruction set. 

30. A computer program product as claimed in any one of claims 23 to 29, wherein 
at least one program instruction within said common subset of instructions performs 
common data processing operations when instructions of either said first instruction set 
or said second instruction set are being decoded but generates different result data 
values. 

31. A computer program product as claimed in claim 30, wherein said at least one 
program instruction generating different result data values includes a program counter 
value as an input operand. 
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32. A computer program product as claimed in claim 31, wherein a different 
relationship is maintained between said program counter value and an address of an 
instruction being executed depending upon whether said instruction decoder is 
operating in said first mode or said second mode. 

33. A computer program product as claimed in claim 30, wherein said at least one 
program instruction generating different result data values includes a program status 
register value as an input operand. 

34. Apparatus for processing data substantially as hereinbefore described with 
reference to the accompanying drawings. 

35. A method of processing data substantially as hereinbefore described with 
reference to the accompanying drawings. 

36. A computer program product data substantially as hereinbefore described with 
reference to the accompanying drawings. 
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ABSTRACT 

INSTRUCTION ENCODING WITHIN A DATA PROCESSING APPARATUS 
HAVING MULTIPLE INSTRUCTION SETS 

A data processing apparatus 2 is provided which supports two instruction sets. 
These two instruction sets share a common subset of instructions including at least one 
class of instructions, such as all of the coprocessor instructions. The common subset 
of instructions have the same instruction encoding once any differences due to storage 
order within memory have been compensated for e.g. endianness. 



[Figure 1] 
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FIGURE 2 (PRIOR ART) : 

BL instruction: 



First half word (HW1) Second half word (HW2) 
15 11 10 0 15 11 10 0 
+ + + + + + 

| 11110 | xxxxxxxxxxx | | 11101 | xxxxxxxxxxx | 
+ + + + + + 

BLX instruction: 

First half word (HW1) Second half word (HW2) 

15 11 10 0 15 11 10 0 

+ + + + + + 

| 11110 | xxxxxxxxxxx | | 11111 | xxxxxxxxxxO I 
+ + + + + + 



FIGURE 3 : 

16-bit instruction if ABCDE is in range 00000 to 11100 

First/only halfword 

15 11 10 0 

\-K \ x ■ •■ ... 

+ „ + — + . 

| ABCDE | xxxxxxxxxxx | 

+ -4- + 

32 -bit instruction if ABCDE is 11101, 11110 or 11111: 

First halfword (HW1) Second halfword (HW2) 
15 11 10 0 15 0 

| ABCDE | xxxxxxxxxxx | | xxxxxxxxxxxxxxxx | 
+ + - - + + + 
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FIGURE 4 - (PRIOR ART) : 

ARM coprocessor instructions with bits [27: 24] = 1100 or 1101: 
31 28 27 25 24 23 22 21 20 19 16 15 12 11 8 7 4 3 0 
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Note: (P,U,W) != (0,0,0) on the LDC and STC rows. 
ARM coprocessor instructions with bits {27: 24] = 1110: 
31 28 27 24 23 21 20 19 16 15 12 11 8 7 5 4 3 0 
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FIGURE 5? 

Coprocessor instructions in the enhanced Thumb instruction set: 

With HW1[11:8] = 1100 or 1101: 

First half word (HW1) Second half word (HW2) 

. 15 12 11 9 8 7 6 5 4 3 0 15 12 11 8 7 4 3 0 

+ + + -- + -- + -- + -- + -- + + + + + + 

| lllx | 110 | 0| 0| 0| 0| x| xxxx | | xxxx | cpnum | xxxxxxxx | 
Undefined 

+ + +__+_-+__+__+__+ + + + + + + 

| lllx | 110 | 0 | 0 | X | 0 | 0 | Rn || Rd | cpnum | opc | CRm | MCRR 

+ + +__+__+__+__+__+ + + +, + + + 

| lllx | 110 | 0 | 0 | X | 0 | X | Rn | | Rd | cpnum | opc | CRm | MRRC 

+ + + + - - + --+-- + + + + + + + 

| lllx | 110 | P| U| N| W| 0| Rn | | CRd | cpnum | offset | STC 

+ + + --+--+ }■--+--+ + + + + + 

| lllx | 110 | P| U| N| W| 1| Rn || CRd | cpnum | offset | LDC 

+ + + -- + -- + -- + --+-- + + + + + + 

Note: (P,U,W) != (0,0,0) on the LDC and STC rows. 
With HW1{11:8] =1110: 

First half word (HW1) Second ha If word (HW2) 

15 12 11 8 7 5 4 3 0 15 12 11 . 8 7 5 4 3 0 

+ --- + + + + + + -- + + -- + + 

| lllx | 1110 | opcl | CRn | | CRd | cpnum | opc2 | 0| CRm | CDP 

+ --+------+ -- + -- + ; - -+ + + + - + - - + + 

| lllx | 1110 | opcl I 0| CRn | | Rd | cpnum | opc2 | l| CRm | MCR 

+ + + + - -+ h + + h +-- + + 

| lllx I 1110 | opcl | l| CRn | | Rd | cpnum | opc2 | l| CRm | MRC 

+ - + -- — + - - + --+ + + + +-- + -- + + 



FIGURE 6: 



ARM unconditional 
CDP instruction 
7 5 4 3 0 

+ + -- + - + 

| opc2 | 0 | CRm | 

+ +-- + + 

| CRd ! cpnum | 
+ + + 

| opcl | CRn | 
+ + + 

-| --.a^ixiia:^ |- 

+ + 



Byte 
address 



A 

A+l 
A+2 

-A+3 



Thiimb unconditional 
CDiP instruction 
7 5 4 3 0 
+ + + 

| opcl | CRn | ) 

+ + + ) HW1 

| 111x1110 | ) 

+ + -- + + 

| opc2 | 0 | CRm | ) 

+ + -- + + ) HW2 

- 1 - . CRd 1- -cpnum | - ) 

+ + + 
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FIGURE 7- (PRIOR ART) 
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FIGURE 8 (PRIOR ART) 
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FIGURE 9:- 
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